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L9 

L10 

Lll 
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ENTERED AT 09:02:40 
0 S AUTOTAXIN# 

0 S Ll/AB 
48 S A2058/AB,BI 
7062 S MELANOMA/AB,BI 
39 S L4 AND L3 

STRACKE, M/AU 
E3 OR E6 
LIOTTA, L/AU 
LIOTTA, L/AU 
E5-9 OR L7 
SCHIFFMANN, E/AU 
E3-6 

L6 OR L8 OR L9 
L10 AND L5 
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60 S L12 OR L12/D 
60 S (MOTILITY FACTOR#)/AB,BI 
31402 S (GROWTH FACTOR#)/AB,BI 
103596 S (TUMOR# OR TUMOUR#)/AB,BI 

° f J*l! ^ (L19 0R L20 °R L21 OR L3) 
-- U3.V awn r n 

- 10 &- C L2 4r AND -£AQ. 


tv*'* ^ 
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0 

58706 

19 

330070 

3872 

1 
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L18 AND L10 
L18 AND L4 

lie and K L 27 OR K daltoh# or kilodalton#)/ab,bi 

L27 1 (Lf L29 122 ° R 123 ° E 124 0R 125 0R 126 0R 

L30 AND L18 

L18 AND (L29) (W) 000/AB.BI 

L18 NOT 1992/PY 


L34 


FILE 'REGISTRY' ENTERED AT 09:24:44 ON 25 FEB 93 
2 S L12 AND SQL=<100 

FILE 'CA' ENTERED AT 09:25:23 ON 25 FEB 93 
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L35 2 S L34 OR L34/D 

L36 12 S Lll AND (PROTEIN# OR PEPTIDE# OR POLYPEPTIDE#) 

=> d bib ab hitrn 131 


ANSWER 1 OF 1 COPYRIGHT 1993 ACS 
CA109(19):164814p 

Isolation and structural characterization of the human 4F2 
heavy-chain gene, an inducible gene involved in T-lymphocyte 
activation 

Gottesdiener, Keith M.; Karpinski, Beverly A.; Lindsten, Tullia; 
Strominger, Jack L.; Jones, Nancy H.; Thompson, Craig B.; Leiden, 
Jeffrey M. 

Howard Hughes Med. Inst., Univ. Michigan 
Ann Arbor, MI 48109, USA 

Mol. cell. Biol., 8(9), 3809-19 C /7 & k 

3-3 (Biochemical Genetics) >»l/** 

13 , 15 ———.. 

J 

MCEBD4 

0270-7306 

1988 


LA Eng 

AB The human 4F2 cell surface antigen is a 120- 
kilodalton (kDa) disulfide-linked heterodimer 

which is composed of an 80- to 90-kDa glycosylated heavy 
chain (4F2HC) and a 35- to 40-kDa nonglycosylated light 
chain (4F2LC). 4F2 belongs to a family of inducible cell surface 
mols. which are involved in T-lymphocyte activation and growth. To 
better understand the mol. mechanism(s) that controls 4F2HC gene 
expression in both resting and activated T cells, a 4F2HC human 
genomic clone was isolated and structurally characterized. The 4F2HC 
gene spans 8 kilobases of chromosome 11 and is composed of nine 
exons. The 5' upstream region of the gene displays several 
properties which are characteristic of housekeeping genes. It is G + 
C rich and hypomethylated in peripheral blood lymphocyte DNA and 
contains multiple binding sites for the Spl transcription factor 
while lacking TATA or CCAAT sequences. This region of the gene also 
displays sequence homologies with several other inducible T-cell 
genes, including the interleukin-2, interleukin-2 receptor .alpha, 
chain, dihydrofolate reductase, thymidine kinase, and transferrin 
receptor genes. A 255-base-pair fragment of the 4F2HC gene which 
contains 154 base pairs of the 5' flanking sequence was able to 
efficiently promote expression of the bacterial chloramphenicol 
acetyltransferase gene in human Jurkat T cells, indicating that it 
contains promoter or enhancer (or both) sequences. Analyses of 
chromatin structure in resting and lectin-activated T cells revealed 
the presence of stable DNase I-hypersensitive sites within both the 
5' flanking and intron 1 regions of the 4F2HC gene. Although the 
4F2HC gene displayed many of the structural features characteristic 
of a constitutively expressed gene, lectin-mediated activation of 
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resting peripheral blood T lymphocytes resulted in a dramatic 
increase in steady-state levels of 4F2HC mRNA. 

IT 111309-64-9 

(amino acid sequence of) 

=> d bib abs hitrn 135 1-2 

L35 ANSWER 1 OF 2 COPYRIGHT 1993 ACS 
AN CA114(7):57704j 

TI The primary structure of DNA binding protein II from the extreme 
thermophilic bacterium Thermus thermophilus 
AU Zierer, Rainer; Choli, Dora 

CS Abt. H-G Wittmann, Max Planck Inst. Mol. Genet. 

LO Berlin, Fed. Rep. Ger. 

SO FEBS Lett., 273(1-2), 59-62 
SC 6-3 (General Biochemistry) 

DT J 
CO FEBLAL 
IS 0014-5793 
PY 1990 
LA Eng 

AN CA114(7):57704j 

AB The primary structure of DNA binding protein II (DNA bp II) from the 
extreme thermophilic bacterium T. thermophilus has been established 
by a combination of manual and automated techniques. The protein has 
95 residues and a mol. mass of 11,843. Comparison of the primary 
structure with the known sequence data of DNA bp II from Clostridium 
pasteurineum, Bacellis stearothemophilus, Escherichia coli, 

Rhizobium meliloti, Anabena, Thermoplasma acidophilum. Pseudomonas 
aeruginosa, and Bacillus caldolyticus reveals a clear homol. among 
these small basic proteins. In particular 2 short sequences in the 
middle and C-terminal part of the proteins (positions 46-51 and 
63-65, resp.) are completely conserved. 

IT 131571-34-1, Protein II (Thermus thermophilus strain HB8 
DNA-binding) 

(amino acid sequence of) 

L35 ANSWER 2 OF 2 COPYRIGHT 1993 ACS 
AN CA111(18):160184b 

TI Outer membrane proteins and nucleotide sequences coding for them in 
vaccines and diagnostic assays for Haemophilus influenzae 
AU Deich, Robert A.; Zlotnick, Gary; Green, Bruce 
CS Praxis Biologies, Inc. 

L0 USA 

SO PCT Int. Appl., 129 pp. 

PI WO 8804932 A1 14 Jul 1988 
DS W: AU, DK, JP, KR 

RW: AT, BE, CH, DE, FR, GB, IT, LU, NL, SE 
AI WO 87-US3423 23 Dec 1987 
PRAI US 86-948364 31 Dec 1986 
US 87-20849 2 Mar 1987 
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IC 

SC 

SX 

DT 

CO 

PY 

LA 

AN 

AB 


US 87-132073 11 Dec 1987 

ICM A61K035-66 

ICS A61K039-00; G01N033-53; G01N033-569 
63-3 (Pharmaceuticals) 

3, 15 
p 

PIXXD2 
1988 
Eng 

CA111(18):160184b 

Peptides and proteins related to an epitope comprisina an outer 

suSh r « e h; Reco “ binant vectors include plasmid DNA and viral DNA 
such as human viruses, animal viruses, insect viruses and 

rJil?ei° Pha ?®2 that direct the expression of the PBOMP-l and PBOMP-2 
related peptides and proteins in appropriate host cells ?he 

of the nuclpnHrfo ___ P re P* Igs for passive immunization. Use 

Ta he»i^L n and erni ? h ^ ln $ rain haart ^sion .Jiu™ con^lO 

7>\i;t p ^: d J n 10 

iFIS rv*" 3 ” 60 ^' 

detergent (e?g T^een 8^^CHAP^Ind^n^^- V? the * rese " ce of 

IT 122983-86-2* f th tlsera P r °tected 100% of the rats. 9 

(peptide, of outer membrane protein of Haemophilus influenzae) 

=> d bib 136 1-12 
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L36 

AN 

TI 

AU 

CS 

LO 

SO 

PI 

DS 


47 pp. 

4 Oct 1990 


AI 

PRAI 


IC 

SC 

SX 

DT 

CO 

PY 

LA 

L36 

AN 

TI 


AU 

CS 

LO 

SO 

PI 

AI 

SC 

SX 

DT 

CO 

PY 

LA 

L3 6 

AN 

TI 

AU 

CS 

LO 

SO 


FR, GB, IT, LU, NL, SE 


ANSWER 1 OF 12 COPYRIGHT 1993 ACS 
CA114(25):243494r 

United states olpt. of Cole^e ' LanCe *''* Krut2sh ' Henry 

USA 

PCT Int. Appl., 

WO 9011287 AI 
W: AU, CA, JP 
RW: AT, BE, CH, DE, DK, ES, 

WO 90-US1526 21 Mar 1990 

US 89-326334 21 Mar 1989 

US 89-380431 17 Jul 1989 

US 89-395453 18 Aug 1989 

AWWOi^s'' C12N ° 01 - 22 ' U12N015-1.5; A61K037-02; A61K039-00; 

7-3 (Enzymes) 

3 
P 

PIXXD2 
1990 
Eng 

ANSWER 2 OF 12 COPYRIGHT 1993 ACS 
CA114(23):221378d 

? V (pro)collagenase-derived peptides as 

N ation4l L ?nS?i^Ls t Sf 1 Hn?th VenS ° n ' Millian '- Krut2sch ' Henry 

USA 

US 317407* A? Pl is NTIS ° rder N °' PAT-APPL-6-317 407. 

US 89-317407 1 Mar 1989 

1-6 (Pharmacology) 

7, 9 
P 

XAXXAV 

1990 

Eng 

ANSWER 3 OF 12 COPYRIGHT 1993 ACS 
CA114(5):40157k 

Tissue inhibitor of metalloproteinases—2 n p»> . , 

tumor ceil lines and human tumor“illues ' ’ expression in 

Stetler-Stevenson, William G.; Brown. Peter D • „ • . 

Levy, Anna T.; Liotta, Lance A ^eter D., Onisto, Maunzio; 

Lab. Pathol., Natl. Cancer Inst. 

Bethesda, MD 20892, USA 
j. Biol. Chem., 265(23), 13933-8 
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sc 14-1 (Mammalian Pathological Biochemistry) 

DT J 
CO JBCHA3 
IS 0021-9258 
PY 1990 
LA Eng 


L36 

AN 

TI 

AU 

CS 

LO 

SO 

SC 

SX 

DT 

CO 

IS 

PY 

LA 


CA114^3^,18521c C ° P¥RIGHT 1993 ACS 

collagenase Characterlza tion of human tumor cell interstitial 

Templeton, Nancy Smyth; Brown, Peter n • t m 

Inger M. K.; Liotta, Lance A ' StetlJ!‘^-o Bvy# Anna .T.; Margulies, 

Lab. Pathol., Natl.'cance? lAst tler " SteVenson ' William 6. 

Bethesda, MD 20892, USA 

Cancer Res., 50(17), 5431-7 

3-3 (Biochemical Genetics) 

7, 14 ' 


J 

CNREA8 

0008-5472 

1990 


Eng 


L36 

AN 

TI 

AU 

CS 

LO 

SO 

SC 

SX 

DT 

CO 

IS 

PY 

LA 


ANSWER 5 OF 12 COPYRIGHT 1993 ACS 
CA113(13):111183h S 


Sulfatide-binding domain of the 
Taraboletti, Giulia; Rao, c. N 
A.; Roberts, David D. 

Pat hol., Natl. Cancer Inst 
Bethesda, MD 20892, USA 

Chem. 265(21), 12253- 
13 '^ enera ^ Biochemistry) 


laminin A chain 
Krutzsch, Henry c. ; Liotta, 


J 

JBCHA3 

0021-9258 

1990 

Eng 


Lance 


L36 

AN 

TI 

AU 

CS 

LO 

SO 

SC 

DT 


ANSWER 6 OF 12 COPYRIGHT 1993 ACS 
CA112(13):112781t ACS 

human^melanoma^ce 11 s° ld increase in 

Bethesda, MD 20892, USA 

Bi ?? hyS * Res * c °romun., 166(2), 757-64 
2 10 (Mammalian Hormones) 

J 
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CO BBRCA9 
IS 0006-291X 
PY 1990 
LA Eng 


L36 

AN 

TI 

AU 

CS 

LO 

SO 

SC 

DT 

CO 

IS 

PY 

LA 


ANSWER 7 OF 12 COPYRIGHT 1993 ACS 
CA112(3):17896a 


The elastin receptor shows structural 
the 67-k Da tumor cell laminin receptor 

Rob f rt P-'• Hinek, Aleksander; 
Robert M.; Liotta, Lance A. 

Dep. Med., Jew. Hosp. 

St. Louis, MO 63110, USA 
J. Bioi. Chem., 264(28), 16652-7 
®“3 (General Biochemistry) 

J 

JBCHA3 

0021-9258 

1989 


and functional similarities to 
Griffin, Gail L.; Senior, 


Eng 


L36 

AN 

TI 

AU 

CS 

LO 

SO 

SC 

SX 

DT 

CO 

IS 

PY 

LA 


ANSWER 8 OF 12 COPYRIGHT 1993 ACS 
CA111 (25):225947Z 

hlH^fi 1 insulin-like growth factor is j 
human melanoma cells 

L --‘ Engel, Jason D.; Wilson, 

,V Lance *•» Schiffman, Elliott 

Lab. Pathol., Natl. Cancer Inst. 

Bethesda, MD 20892, USA 

Ch ?“*' 264 06), 21544-9 

2-10 (Mammalian Hormones) 

14, 15 
J 

JBCHA3 

0021-9258 

1989 

Eng 


motility receptor in 
Lori W.; Rechler, Matthew 


L3 6 

AN 

TI 


AU 

CS 

LO 

SO 

SC 

DT 

CO 


ANSWER 9 OF 12 COPYRIGHT 1993 ACS 
CA110(17):150420r 

Margiu;s? e Jngeri. W j:^^ 0 ?i', K ^ce 0 A' He " ry ^ WaCher ' Mar * P 

Lab. Pathol., Natl. Cancer Inst. 

Bethesda, MD 20892, USA 
J. Biol. Chem., 264(3), 1353-6 
7-5 (Enzymes) 

J 

JBCHA3 



IS 0021-9258 
PY 1989 
LA Eng 
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L3 6 

AN 

TI 


AU 

CS 

LO 

SO 

PI 

AI 

SC 

DT 

CO 

PY 

LA 

L36 

AN 

TI 

AU 

CS 

LO 

SO 

SC 

DT 

CO 

IS 

PY 

LA 


ANSWER 10 OF 12 COPYRIGHT 1993 ACS 
CA108(11):92970r 

Liotta, l. A.; Schiffmann, E. 

United states Dept, of Health and Human Services 

US 583sf • A 5 PP i'Nov°i9!7 AVaU ' MTIS NO ‘ ^T-APPL-7-58381. 

US 87-58381 5 Jun 1987 

15-5 (Immunochemistry) 

XAXXAV 

1987 

Eng 

ANSWER 11 OF 12 COPYRIGHT 1993 ACS 
CA107(13):110769t 

° f the 

Elliott' ^ *' Guirguis ' Raouf; Liotta, Lance A.; Schiffmann, 

Lab. Pathol., Natl. Inst. Health 
Bethesda, MD 20892, USA 

Res- c °“ ,un -' i46 < i >' 339 -« 

BBRCA9 

0006-291X 

1987 

Eng 


L3 6 
AN 
TI 
AU 

CS 

LO 

SO 

SC 

DT 

CO 

IS 

PY 

LA 


COPYRIGHT 1993 ACS 


ANSWER 12 OF 12 
CA105(7):58748c 
Tumor cell autocrine motility factor 

Gordon * RlcharS'r Ma ^ 3er - Ra V a ; Murano, Genesl 

Natl CanceD Tnat'" £ hl J n ;!' Peter K ’ ; Sc hiffman 
d Zv *r ancer Inst., Food Drug Adm. 

Bethesda, MD 20892, USA * 

Proc. Natl. Acad. Sci. u. S. A. 83 fl 0 l 
14-1 (Mammalian Pathological Biochemistry) 


PNASA6 

0027-8424 

1986 

Eng 


Katz, David A.; 
Elliott 
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=> d bib ab hitrn 133 1-10 

L33 ANSWER 1 OF 58 COPYRIGHT 1993 ACS 
AN CA118(1):2872q 

TI Cloning, sequencing and expression of the sialidase gene from 
Actinomyces viscosus DSM 43798 

AU Henningsen, Michaela; Roggentin, Peter; Schauer, Roland 
CS Biochem. Inst., Univ. Kiel 
L0 Kiel W-2300, Germany 

SO Biol. Chem. Hoppe-Seyler, 372(12), 1065-72 
SC 7-5 (Enzymes) 

SX 3, 10 
DT J 
CO BCHSEI 
IS 0177-3593 
PY 1991 
LA Eng 

AB Chromosomal DNA from A. viscosus was digested with restriction 

endonucleases and the fragments ligated with pUC-vectors were used 
to transform Escherichia coli cells. Clones bearing the required 
sialidase gene were detected by spraying the colonies with the 
fluorogenic sialidase substrate 4-methylumbelliferyl-.alpha.-D- 
acetylneuraminic acid. The identity of the cloned sialidase was 
confirmed after 5700-fold enrichment and comparison with the 
purified enzyme of A. viscosus. Both sialidases were identical with 
regard to mol. mass, substrate specificity tested with 
sialyllactoses, and the inhibition of their activity by heterologous 
antisialidase antibodies. The sequenced insert revealed a mol% G + C 
of 68.2, typical for A. viscosus. An open reading frame of 2739 bp 
follows a sequence with dyad symmetry and an AG-rich region, and 
codes for 913 amino acids representing a mol. mass of 113 kDa. The 
conserved amino acid sequence [Ser-X-Asp-X-Gly-X-Thr-Trp] typical 
for bacterial sialidases was found at 5 positions in the predicted 
amino acid sequence. The gene of this enzyme is expressed by E. 
coli, despite the low relatedness of the 2 species. 

IT 144813-79-6, Sialidase (Actinomyces viscosus DSM 43798 
reduced) 

(amino acid sequence of, complete) 

L33 ANSWER 2 OF 58 COPYRIGHT 1993 ACS 
AN CA117(17):167649q 

TI Peptide composition and enzyme activities of isolated pyrenoids from 
the green alga Bryopsis maxima 
AU Okada, M.; Okabe, Y.; Kono, M.; Nakayama, K.; Satoh, H. 

CS Fac. Sci., Toho Univ. 

LO Funabashi 274, Japan 
SO Can. J. Bot., 69(5), 1053-61 

SC 11-1 (Plant Biochemistry) 

SX 3,7 
DT J 
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CO CJBOAW 
IS 0008-4026 
PY 1991 
LA Eng 

AB Pyrenoids of B. maxima contained several minor components other than 
the large subunit (LS) and the small subunit of ribulose 
1,5-bisphosphate carboxylase/oxygenase (Rubisco). Among the minor 
components, polypeptides of 95, 67, and 41 kDa reacted with an 
antibody against the LS polypeptide. Amino acid sequences of these 
polypeptides were detd. and compared with that deduced from the LS 
gene (rbcL) screened from the chloroplast DNA library of B. maxima. 
The N-terminal sequence of the LS peptide was not 
post-translationally processed and was almost identical with those 
of the polypeptides of 91, 67, and 41 kDa. The starch grains 
surrounding the pyrenoids contained a polypeptide of 66 kDa that was 
assigned as starch synthase. 

IT 143861-32-9 143861-35-2 

(amino acid sequence of) 

L33 ANSWER 3 OF 58 COPYRIGHT 1993 ACS 
AN CA117(15):144555s 

TI Nucleotide sequence of the large subunit of ribulose-1,5- 

bisphosphate carboxylase/oxygenase from the green alga Bryopsis 
maxima 

AU Kono, Midori; Satoh, Hiroyuki; Okabe, Yasuyuki; Abe, Yasuko; 

Nakayama, Katsumi; Okada, Mitsumasa 
CS Fac. Sci., Toho Univ. 

LO Funabashi 274, Japan 
SO Plant Mol. Biol., 17(3), 505-8 
SC 3-3 (Biochemical Genetics) 

SX 7, 10, 11 
DT J 
CO PMBIDB 
IS 0167-4412 
PY 1991 
LA Eng 

AB A library was constructed for whole chloroplast DNA of B. maxima. 

The library, screened with the DNA probe of the large subunit (LS) 
of cyanobacterium Rubisco, showed pos. hybridization to 50 out of 
approx. 9000 plaques. .lambda.MHB7, contained the whole rbcL gene of 
B. maxima. The nucleotide sequence contained 1425 bp, encoding 475 
amino acids. The deduced amino acid sequence of B. maxima LS protein 
showed 92% homol. with that of Chlorella LS protein and 90% homol. 
with that of Chlamydomonas LS protein. The amino acid sequences of 
RuDP-binding sites and C02-binding site of B. maxima LS protein 
showed as much of 95% homol. to those of Chlamydomonas. An intron of 
2467 bp was found between codons 268 and 269 of B. maxima rbcL. The 
boundary sequences of the intron most resembled the group III intron 
described for chloroplast DNA. There was an open reading frame of 
822 bp in this intron. It showed no significant gene homol. with the 
sequences in the GenBank database, but its transcribed products were 
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detected by Northern blotting using the whole of the reading frame 
as a probe. Efforts are underway to assess the physiol, significance 
of this region. A sequence located 118 bp upstream from the B. 
maxima rbcL initiation site for translation was identical with that 
of the -10 box of the LS promoter region of rbcL of tobacco, maize 
and spinach. The sequence T-T-G-G-T-T (at the position of 19 bp 
upstream from the -10 box) apparently corresponded to the -35 box. A 
long palindrome sequence (position 4301 to 4346) capable of forming 
a 23 bp stem was in the rbcL termination region. The partial coding 
region was found on the same strand at 310 bp upstream from the rbcL 
coding sequence. Nucleotide sequence homol. was noted between this 
coding region and that of the .alpha, subunit of ATP synthase (atpA) 
of liverwort, pea, spinach and tobacco. 

IT 143637-87-0 143637-89-2 

(amino acid sequence of) 

L33 ANSWER 4 OF 58 COPYRIGHT 1993 ACS 
AN CA117(11):107156s 

TI M13 cloning and nucleotide sequence of the sorghum chloroplast gene 
for the large subunit of ribulose-1,5-bisphosphate carboxylase 
AU Lou, Huan; Zhang, Bin; Qi, Defang 
CS Shanghai Inst. Biochem., Acad. Sin. 

LO Shanghai, Peop. Rep. China 

SO Shengwu Huaxue Yu Shengwu Wuli Xuebao, 21(3), 179-88 
SC 7-5 (Enzymes) 

SX 3, 11 
DT J 
CO SHWPAU 
IS 0582-9879 
PY 1989 
LA Ch 

AB The cloned gene for the large subunit of ribulose-1,5-bisphosphate 
carboxylase (rboL) from sorghum has been sequenced by the method of 
M13/dideoxy chain-termination. Among the total no. of 1839 bp 
sequenced, the coding region of the rbcL gene contains 1428 bp (476 
codons). A sequence GGAGG corresponding to SD sequence occurs 6 to 
10 bp upstream from the initiation codon ATG. The 5' flanking region 
contains sequences resembling the "-10 region" and "-35 region" 
sequences of the prokaryote promoter. The 3' flanking region 
contains a sequence which can form a stem-and-loop structure similar 
to the terminator of the prokaryote gene. Comparison of rboL gene of 
with that of maize shows 97.7% homol. in the coding region, 
and 90% homol. in the noncoding flanking region. However, the rboL 
genes of sorghum and maize differ obviously from those of C3 plants 
(spinach and tobacco). In M13 cloning, restriction enzyme partial 
digestion of clones resulted in identification of recombinant phages 
more simple, rapid and accurate. The sequencing of that DNA region 
was nonrandom and the efficiency was improved. 

IT 143108-32-1 

(amino acid sequence of, complete) 
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L33 ANSWER 5 OF 58 COPYRIGHT 1993 ACS 
AN CA117(9):84434q 

TI Expression of M-cadherin, a member of the cadherin multigene family, 
correlates with differentiation of skeletal muscle cells 
AU Donalies, Michael; Cramer, Matthias; Ringwald, Martin; 

Starzinski-Powitz, Anna 
CS Inst. Genet., Univ. Koeln 

LO Cologne D-5000/1, Germany 

SO Proc. Natl. Acad. Sci. U. S. A., 88(18), 8024-8 
SC 3-3 (Biochemical Genetics) 

SX 6, 13 
DT J 
CO PNASA6 
IS 0027-8424 
PY 1991 
LA Eng 

AB Cadherins, a multigene family of transmembrane glycoproteins, 

mediate Ca2+-dependent intercellular adhesion. They are thought to 
be essential for the control of morphogenetic processes, including 
myogenesis. The identification and characterization of the cDNA of 
another member of the cadherin family, M-cadherin (M for muscle), 
from differentiating muscle cells is reported. The longest open 
reading frame of the cDNAs isolated contains almost the entire 
coding region of the mature M-cadherin as detd. by sequence homol. 
to the known cadherins. M-cadherin mRNA is present at low levels in 
myoblasts and is upregulated in myotube-forming cells. In mouse L 
cells (fibroblasts), M-cadherin mRNA is undetectable. This 
expression pattern indicates that M-cadherin is part of the myogenic 
program and may provide a trigger for terminal muscle 
differentiation. 

IT 142845-03-2, Cadherin M (mouse C-terminal fragment protein 
moiety reduced) 

(amino acid sequence of) 


L3 3 

ANSWER 6 OF 58 COPYRIGHT 1993 ACS 


AN 

CA117(7):64022a 


TI 

Molecular cloning and analysis of small optic 
brain gene of Drosophila melanogaster 

lobes, a structural 

AU 

Delaney, S. J.; Hayward, D. C.; Barleben, F.; 
Miklos, G. L. Gabor 

Fischbach, K. F.; 

CS 

Res. Sch. Biol. Sci., Aust. Natl. Univ. 


LO 

Canberra 2601, Australia 


SO 

Proc. Natl. Acad. Sci. U. S. A., 88(16), 7214- 

-18 

SC 

3-3 (Biochemical Genetics) 


SX 

6 , 12 


DT 

J 


CO 

PNASA6 


IS 

0027-8424 


PY 

1991 


LA 

Eng 


AB 

Mutations in the small optic lobes (sol) gene 

of D. melanogaster 
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cause specific cells to degenerate in the developing optic lobes, 
resulting in the absence of certain classes of columnar neurons. 
These neuronal defects lead to specific alterations in behavioral 
characteristics, particularly during flight and walking maneuvers. 
The wild-type sol locus was isolated by microcloning and chromosomal 
walking and its genetic and mol. limits were established. Two major 
transcripts of 5.8 and 5.2 kb are produced from this locus by 
alternative splicing and are present throughout the entire life 
cycle. Sequence analyses of cDNAs corresponding to these 2 classes 
of transcripts predict 2 proteins of 1597 and 395 amino acids. The 
first shows similarity in its carboxyl-terminal region to the 
catalytic domain of a vertebrate calcium-activated neutral protease 
(calpain), whereas its amino-terminal region contains several 
zinc-finger-like repeats of the form WXCY2CX10-11CX2C. The second 
predicted protein contains only the first 2 of the zinc-finger-like 
repeats and is missing the calpain domain. By constructing 
transgenic flies carrying a single wild-type copy of the sol gene in 
a homozygous sol mutant background, the normal neuroanatomical 
phenotype was restored to individuals that would have developed 
mutant brains. 

IT 142661-94-7, Protein (Drosophila melanogaster clone 

.lambda.cO.22/.lambda.cO.32 gene small—optic—lobes reduced) 

(amino acid sequence of) 

L33 ANSWER 7 OF 58 COPYRIGHT 1993 ACS 
AN CA116(13):122076p 

TI Tick-borne encephalitis virus strain 205: nucleotide sequence of 
genes and complete amino acid sequence of viral proteins 
AU Safronov, P. F.; Netesov, S. V.; Mikryukova, T. P.; Blinov, V. M.; 

Osipova, E. G.; Kiseleva, N. N.; Sandakhchiev, L. S. 

CS VNII Mol. Biol. 

LO USSR 

SO Mol. Genet., Mikrobiol. Virusol., (4), 23-9 
SC 3-2 (Biochemical Genetics) 

SX 6 
DT J 
CO MGMVDU 
IS 0208-0613 
PY 1991 
LA Russ 

AB The 10466 nucleotide long sequence of the cDNA copy of the 

tick-borne encephalitis strain 205 viral genome was detd. It 
includes the 5'-nontranslated region, the genes for structural as 
well as nonstructural proteins and the first 93 nucleotides of 
3'—nontranslated region. The difference in amino acid sequences of 
structural and nonstructural proteins of strains 205, Sofjin and 
Neudoerfl of the tick-borne encephalitis virus and the nucleotide 
changes in 5'- and 3'-nontranslated regions of these strains are 
discussed. 

IT 128808-03-7, Protein M (tick-borne encephalitis virus strain 205 
clone II-71/I-62) 128808-67-3, Glycoprotein E (tick-borne 
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encephalitis virus strain 205 clone II-71/I-62 protein moiety 
reduced) 128808-68-4, Glycoprotein prM (tick-borne encephalitis 
virus strain 205 clone II-71/I-62 precursor protein moiety reduced) 
128808-69-5, Glycoprotein prM (tick-borne encephalitis virus strain 
205 clone II-71/I-62 protein moiety reduced) 139074-91-2, Protein 
C (tick-borne encephalitis virus strain 205 clone 11-71) 

139076-24-7 139076-26-9, Protein NS 1 (tick-borne 

encephalitis virus strain 205 clone III-23/I-3 reduced) 

139076-27-0, Protein NS 5 (tick-borne encephalitis virus strain 205 
clone II-4/III-45/III-38/VI-49 reduced) 139076-28-1, 

Protein NS 3 (tick-borne encephalitis virus strain 205 clone 
1-47/III—30/1-30 reduced) 139076-29-2, Protein NS 2A (tick-borne 
encephalitis virus strain 205 clone 1-3 reduced) 139076-30-5, 
Protein NS 4A (tick-borne encephalitis virus strain 205 clone 1-30 
reduced) 139076-31-6, Protein NS 2B (tick-borne encephalitis virus 
strain 205 clone I-3/III-59 reduced) 139076-32-7, Protein NS 4B 
(tick-borne encephalitis virus strain 205 clone I-30/III-15 reduced) 
(amino acid sequence of) 

L33 ANSWER 8 OF 58 COPYRIGHT 1993 ACS 
AN CA116(1):1417s 

TI The regions of sequence variation in caulimovirus gene VI 

AU Sanger, Margaret; Daubert, Steve; Goodman, Robert M. 

CS Dep. Plant Pathol., Univ. California 
LO Davis, CA 95616, USA 

SO Virology, 182(2), 830-4 

SC 3-2 (Biochemical Genetics) 

SX 6 
DT J 
CO VIRLAX 
IS 0042-6822 
PY 1991 
LA Eng 

AB The sequence of gene VI from figwort mosaic virus (FMV) clone x4 was 

detd. and compared with that previously published for FMV clone DxS. 

Both clones originated from the same virus isolation, but the virus 
used to clone DxS was propagated extensively in a host of a 
different family prior to cloning whereas that used to clone x4 was 
not. Differences in the amino acid sequence inferred from the DNA 
sequences occurred in 2 clusters. An N-terminal conserved region 
preceded 2 regions of variation sepd. by a central conserved region. 
Variation in cauliflower mosaic virus (CaMV) gene VI sequences, all 
of which were derived from virus isolates from hosts from one host 
family, was similar to that seen in the FMV comparison, though the 
extent of variation was less. Alignment of gene VI domains from FMV 
and CaMV revealed regions of amino acid sequence identical in both 
viruses within the conserved regions. The similarity in the pattern 
of conserved and variable domains of these two viruses suggests 
common host-interactive functions in caulimovirus gene VI homologs, 
and possibly an analogy between caulimoviruses and certain animal 
viruses in the influence of the host on sequence variability of 
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viral genes. 

IT 137800-99-8, Protein IBMP (figwort mosaic virus clone DxS 

reduced) 137801-00-4, Protein IBMP (figwort mosaic virus clone x4 
reduced) 

(amino acid sequence of) 

L33 ANSWER 9 OF 58 COPYRIGHT 1993 ACS 
AN CA115(19):199835X 

TI Complete cDNA sequence of a South American isolate of potato virus X 
[Erratum to document cited in CA114(1):1317t] 

AU Orman, Betina, E.; Celnik, Rosana M.; Mandel, Alejandra M.; Torres, 
Hector N.; Mentaberry, Alejandro N. 

CS Inst. Invest. Ing. Genet. Biol. Mol., CONICET 
LO Buenos Aires 1428, Argent. 

SO Virus Res., 19(2-3), 236 

SC 3-2 (Biochemical Genetics) 

SX 6 
DT J 
CO VIREDF 
IS 0168-1702 
PY 1991 
LA Eng 

AB Errors in the genomic sequence in Figure 2 have been cor. The errors 
were reflected in the index entries. 

IT 123514-40-9, Protein (potato virus X strain Xc coat reduced) 
130843-48-0, Protein (potato virus X strain Xc 7.6-kilodalton 
reduced) 130844-13-2, Protein (potato virus X strain Xc 
12 .3-kilodalton reduced) 130844-14-3, Protein (potato 
virus X strain Xc 165-kilodalton reduced) 130844-15-4, Protein 
(potato virus X strain Xc 24.3-kilodalton reduced) 

(amino acid sequence of (Erratum)) 

L33 ANSWER 10 OF 58 COPYRIGHT 1993 ACS 

AN CA115(15):152140b 

TI The recognition component of the N-end rule pathway 
AU Bartel, Bonnie; Wuenning, Ingrid; Varshavsky, Alexander 
CS Dep. Biol., Massachusetts Inst. Technol. 

LO Cambridge, MA 02139, USA 

SO EMBOJ., 9(10), 3179-89 

SC 3-3 (Biochemical Genetics) 

SX 6 

DT J 

CO EMJODG 

IS 0261-4189 

PY 1990 

LA Eng 

AB The N-end rule-based degrdn. signal, which targets a protein for 
ubiquitin-dependent proteolysis, comprises a destabilizing 
amino-terminal residue and a specific internal lysine residue. Here, 
the isolation and functional anal, of a gene (UBR1) for the N-end 
recognizing protein of the yeast Saccharomyces cerevisiae is 
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reported. UBR1 Encodes a .apprx.225-kDa protein with no significant 
sequence similarities to other known proteins. Null ubrl mutants are 
viable but ar unable to degrade the substrates of the N-end rule 
pathway. These mutants are partially defective in sporulation and 
grow slightly more slowly than wild-type. The UBR1 protein 
specifically binds in vitro to proteins bearing amino-terminal 
residues that are destabilizing according to the N-end rule, but 
does not bind to otherwise identical proteins bearing stabilizing 
amino-terminal residues. 

IT 136249-46-2 

(amino acid sequence of) 

=> select hit rn 131 

ENTER ANSWER NUMBER OR RANGE (1):1 

El THROUGH El ASSIGNED 

=> select hit rn 135 1-2 
E2 THROUGH E3 ASSIGNED 

=> select hit rn 133 1-10 
E4 THROUGH E15 ASSIGNED 

=> fil reg 

FILE 'REGISTRY' ENTERED AT 09:40:52 ON 25 FEB 93 

USE IS SUBJECT TO THE TERMS OF YOUR CUSTOMER AGREEMENT 

COPYRIGHT (C) 1993 American Chemical Society (ACS) 

STRUCTURE FILE UPDATES: 19 FEB 93 HIGHEST RN 146074-42-2 
DICTIONARY FILE UPDATES: 24 FEB 93 HIGHEST RN 146074-42-2 


=> s 
L37 

el 

1 

111309-64-9/RN 

=> s 

e2-3 

1 

122983-86-2/RN 



1 

131571-34-1/RN 

L38 


2 

(122983-86-2/RN OR 131571-34-1/RN) 

=> s 

e4-15 

1 

130844-14-3/RN 



1 

136249-46-2/RN 



1 

137800-99-8/RN 



1 

139076-24-7/RN 



1 

139076-28-1/RN 



1 

142661-94—7/RN 



1 

142845-03-2/RN 



1 

143108-32-1/RN 



1 

143637-87-0/RN 



1 

143861-32-9/RN 



1 

143861-35-2/RN 



1 

144813-79—6/RN 
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L39 12 (130844-14-3/RN OR 136249-46-2/RN OR 137800-99-8/RN OR 139 

076-24—7/RN OR 139076-28-1/RN OR 142661-94-7/RN OR 142845- 
03-2/RN OR 143108-32-1/RN OR 143637-87-0/RN OR 143861-32-9 
/RN OR 143861-35-2/RN OR 144813-79-6/RN) 

=> d sqide 137 

L37 ANSWER 1 OF 1 COPYRIGHT 1993 ACS 
RN 111309-64-9 REGISTRY 

CN Antigen 4F2 (human clone pcD-4F2.A heavy chain protein moiety 
reduced) (9CI) (CA INDEX NAME) 

FS PROTEIN SEQUENCE 
SSI Antigen 4F2 
ORGN human 
CLO PCD-4F2.A 
SHG reduced 
SQL 529 

SEQ 1 MSQDTEVDMK EVELNELEPE KQPMNAASGA AMSLAGAEKN GLVKIKVAED 

51 EAEAAAAAKF TGLSKEELLK VAGSPGWVRT RWALLLLFWL GWLGMLAGAV 
101 VIIVRAPRCR ELPAQKWWHT GALYRIGDLQ AFQGHGAGNL AGLKGRLDYL 
151 SSLKVKGLVL GPIHKNQKDD VAQTDLLQID PNFGSKEDFD SLLQSAKKKS 
201 IRVILDLTPN YRGENSWFST QVDTVATKVK DALEFWLQAG VDGFQVRDIE 
251 NLKDASSFLA EWQNITKGFS EDRLLIAGTN SSDLQQILSL LESNKDLLLT 
301 SSYLSDSGST GEHTKSLVTQ YLNATGNRWC SWSLSQARLL TSFLPAQLLR 
351 LYQLMLFTLP GTPVFSYGDE IGLDAAALPG QPMEAPVMLW DESSFPDIPG 

401 AVSANMTVKG QSEDPGSLLS LFRRLSDQRS KERSLLHGDF HAFSAGPGLF 

451 SYIRHWDQNE RFLWLNFGD VGLSAGLQAS DLPASASLPA KADLLLSTQP 

501 GREEGSPLEL ERLKLEPHEG LLLRFPYAA 

MF Unspecified 
Cl MAN 
SR CA 
LC CA 

2 REFERENCES IN FILE CA (1967 TO DATE) 

=> d sqide 138 1-2 

L38 ANSWER 1 OF 2 COPYRIGHT 1993 ACS 
RN 131571-34-1 REGISTRY; 

CN Protein II (Thermus thermophilus strain HB8 DNA-binding) (9CI) (CA 
INDEX NAME) 

FS PROTEIN SEQUENCE 
SSI Protein II 
ORGN Thermus thermophilus 
SQL 95 

SEQ 1 AAKKTVTKAD LVDQVAQATG LKLLDVKAMV DALLAKVEEA LANGSKVQLT 

51 GFGTFEVRKR KARTGVKPGT KEKIKIPATQ YPAFKPGKAL KDKVK 
MF C458 H781 N127 0129 S 
Cl MAN 
SR CA 
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LC CA 
DES 5:ALL,L 

1 REFERENCES IN FILE CA (1967 TO DATE) 

L38 ANSWER 2 OF 2 COPYRIGHT 1993 ACS 
RN 122983-86-2 REGISTRY 

CN L-Alaninamide, L-tyrosyl-L-asparaginyl-L-threonyl-L-valyl-L-tyrosyl- 
L-phenylalanylglycyl-L-phenylalanyl-L-.alpha.-aspartyl-L-lysyl-L- 
tyrosyl-L-.alpha.-aspartyl-L-isoleucyl-L-threonylglycyl-L- 
phenylalanyl-L-tyrosyl-L-valyl-L-threonyl-L-isoleucyl-L-.alpha.- 
aspartyl-L-alanyl-L-.alpha.-aspartyl-L-alanyl-L-alanyl-L-tyrosy1-L- 
leucyl-L-asparaginyl-L-alanyl-L-threonyl-L-prolyl-L-alanyl- (9CI) 

(CA INDEX NAME) 

FS PROTEIN SEQUENCE 
SSI L-Alaninamide 

SBN L-tyrosyl-L-asparaginyl-L-threonyl-L-valyl-L-tyrosyl-L- 

phenylalanylglycyl-L-phenylalanyl-L-.alpha.-aspartyl-L-lysyl-L- 
tyrosyl-L-.alpha.-aspartyl-L-isoleucyl-L-threonylglycyl-L- 
phenylalanyl-L-tyrosyl-L-valy1-L-threonyl-L-isoleucyl-L-.alpha.- 
aspartyl-L-alanyl-L-.alpha.-aspartyl-L-alanyl-L-alanyl-L-tyrosyl-L- 
leucyl-L-asparaginyl-L-alanyl-L-threonyl-L-prolyl-L-alanyl- 
SQL 33 
NTE modified 


type 

- location - 

description 

terminal mod. 

Ala-33 

C-terminal amide 


SEQ 1 YNTVYFGFDK YDITGFYVTI DADAAYLNAT PAA 

MF C173 H241 N37 052 

Cl MAN 

SR CA 

LC CA 

DES 5:ALL,L 

1 REFERENCES IN FILE CA (1967 TO DATE) 

=> d his 140 

(FILE 'REGISTRY' ENTERED AT 09:40:52 ON 25 FEB 93) 

L40 12 S L12 AND L39 

=> d 140 sgide 1-12 

L40 ANSWER 1 OF 12 COPYRIGHT 1993 ACS 
RN 144813-79-6 REGISTRY 

CN Neuraminidase (Actinomyces viscosus strain DSM 43798 reduced) (9CI) 
(CA INDEX NAME) 

FS PROTEIN SEQUENCE 
SSI Neuraminidase 
ORGN Actinomyces viscosus 
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SHG reduced 
SQL 913 


SEQ 1 MTSHSPFSRR 

51 NAPADGLYSV 
101 AGTTKTDCTG 
151 KGATSPVKAN 
201 TESSFDDLGR 
251 TATGTDGATL 
301 LAANTATDNY 
351 RSTDGGKTWS 
401 QGWGGSRGGT 

451 FAASGQGIQI 
501 PIGTGMDENK 
551 KNLPDSVDNA 
601 CDDGASWTTS 
651 NFTMNWLGEQ 
701 HHPPRPSRAL 
751 PETSSAPAAE 
801 ATGAATEPGT 
851 PARRVPRRRR 
901 NTRRAGHPAL 
HITS AT: 419-423 


RLPALLGSLP LAATGLIAAA 
GDVMTFNITL TNTSGEAHSY 
LATHTVTAED LKAGGFTPQI 
SLRVESITPS SSQENYKLGD 
QCHWGGLKPG KGAVYNCKPL 
QTLTATGNPI NWGDHPQAT 
RIPAIPPPPM GTCSSPTTSA 
APTYIHQGTE TGKKVGYSDP 
DPENRGIIQA EVSTSTDNGW 

QHGPHAGRLV QQYTIRTAGG 
WELSDGSLM LNSRASDGSG 
QIIRAFPNAA PDDPRAKVLL 
KVFHEPFVGY TTIAVQSDGS 
CGQKPAEPSP GRRRRRHPQR 
RPSRAGPGAG AHDRSEHGAH 
PTQAPTVAPS VEPTQAPGAQ 
PSSSASPAPS RNAAPTPKPG 
RRRPAAGCLA RDQRAADPGP 
STD 


PPAHAVPTSD GLADVTITQV 
APASTNLSGN VSKCRWRNVP 
AYEVKAVEYA GKALSTPETI 
TVSYTVRVRS VSDKTINVAA 
THTITQADVD AGRWTPSITL 
PAPAPDASTE LPASMSQAQH 
RRTTATAAAT TPNPNHIVQR 
SYWDHQTGT IFNFHVKSYD 
TWTHRTITAD ITKDKPWTAR 

PVQAVSVYSD DHGKTWQAGT 
FRKVAHSTDG GQTWSEPVSD 
LSHSPNPRPW CRDRGTISMS 
IGLLSEDAHN GADYGGIWYR 
HRRRSRPRRP RRALSPRRHR 
TGSCAQSAPE QTDGPTAAPA 
PSSAPKPGAT GRAPSWNPK 
MEPDEIDRPS DGTMAQPTGA 
CGCRGCRRVP AAAGSPFEEL 


MF Unspecified 
Cl MAN 
SR CA 
LC CA 

1 REFERENCES IN FILE CA (1967 TO DATE) 


L40 ANSWER 2 OF 12 COPYRIGHT 1993 ACS 
RN 143861-35-2 REGISTRY 

CN Carboxylase, ribulose diphosphate (Bryopsis maxima chloroplast clone 

.lambda.-DNA7 gene rbcL large subunit reduced) (9CI) (CA INDEX 
NAME) 

FS PROTEIN SEQUENCE 

SSI Ribulose diphosphate carboxylase 

ORGN Bryopsis maxima 

CLO .lambda.-DNA7 

GEN rbcL 

SHG reduced 

SQL 474 


SEQ 1 APKT ETKAGA 

51 ECGAAVAAES 
101 AYPLDLFEEG 
151 PHGIEVERDK 
201 DDENVNSQPF 

251 QRAQCAKELG 
301 DRQKNHGMHF 


GFKAGVKDYR LTYYTPDYQV 
STGTWTTVWT DGLTSLDRYK 
SVTNLFTSIV GTVFGFKALR 
LNKYGRPLLG CTIKPKLGLS 
MRWRDRFLFV AEAIYKSQAE 

VPIIMHDYLT GGWTANTSLA 
RVLAKSLRMS GGDHLHSGTV 


KDTDILAAFR MTPQPGVPPE 
GRCYDLEPVK GEENQYIAYV 
ALRLEDLRIS VAYAKTFQGP 
AKNYGRAVYE CLRGGLDFTK 
TGEIKGHYLN ATAATCEAML 

HYCRDHGLLL HIHRAMHAVI 
VGKLEGEREV TLGFVDLMRD 
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351 DFIEKDRARG IYFTQDWVLL PGVMPVASGG IHVWHMPALV EIFGDDACLQ 
401 FGGGTLGHPW GNAPGAAANR IACEACTQAR NEGRHLAREG GDVIRAACKW 
451 SPELSAACEV LKEIKFEFET IDTL 
HITS AT: 238-242 

MF Unspecified 
Cl PMS, MAN 
PCT Manual registration 
SR CA 
LC CA 

1 REFERENCES IN FILE CA (1967 TO DATE) 

L40 ANSWER 3 OF 12 COPYRIGHT 1993 ACS 
RN 143861-32-9 REGISTRY 

CN Carboxylase, ribulose diphosphate (Bryopsis maxima chloroplast clone 

.lambda.-DNA7 gene rbcL large subunit precursor reduced) (9CI) (CA 
INDEX NAME) 

FS PROTEIN SEQUENCE 

SSI Ribulose diphosphate carboxylase 

ORGN Bryopsis maxima 

CLO .lambda.-DNA7 

GEN rbcL 

SHG reduced 

SQL 475 

SEQ 1 MAPKTETKAG AGFKAGVKDY RLTYYTPDYQ VKDTDILAAF RMTPQPGVPP 

51 EECGAAVAAE SSTGTWTTVW TDGLTSLDRY KGRCYDLEPV KGEENQYIAY 
101 VAYPLDLFEE GSVTNLFTSI VGTVFGFKAL RALRLEDLRI SVAYAKTFQG 
151 PPHGIEVERD KLNKYGRPLL GCTIKPKLGL SAKNYGRAVY ECLRGGLDFT 
201 KDDENVNSQP FMRWRDRFLF VAEAIYKSQA ETGEIKGHYL NATAATCEAM 

251 LQRAQCAKEL GVPIIMHDYL TGGWTANTSL AHYCRDHGLL LHIHRAMHAV 
301 IDRQKNHGMH FRVLAKSLRM SGGDHLHSGT WGKLEGERE VTLGFVDLMR 
351 DDFIEKDRAR GIYFTQDWVL LPGVMPVASG GIHVWHMPAL VEIFGDDACL 
401 QFGGGTLGHP WGNAPGAAAN RIACEACTQA RNEGRHLARE GGDVIRAACK 
451 WSPELSAACE VLKEIKFEFE TIDTL 
HITS AT: 239-243 

MF Unspecified 
Cl PMS, MAN 
PCT Manual registration 
SR CA 
LC CA 

1 REFERENCES IN FILE CA (1967 TO DATE) 

L40 ANSWER 4 OF 12 COPYRIGHT 1993 ACS 
RN 143637-87-0 REGISTRY 

CN Carboxylase, ribulose diphosphate (Bryopsis maxima chloroplast clone 

.lambda.MHB7 large subunit precursor reduced) (9CI) (CA INDEX NAME) 
FS PROTEIN SEQUENCE 
SSI Ribulose diphosphate carboxylase 
ORGN Bryopsis maxima 
CLO .1ambda.MHB7 
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SHG 

SQL 

SEQ 


HITS 

MF 

Cl 

SR 

LC 


L40 

RN 

CN 

FS 

SSI 

ORGN 

CLO 

SHG 

SQL 

SEQ 


HITS 

MF 

Cl 

PCT 

SR 

LC 


reduced 

475 

1 MAPKTETKAG AGFKAGVKDY RLTYYTPDYQ VKDTDILAAF RMTPQPGVPP 
51 EECGAAVAAE SSTGTWTTVW TDGLTSLDRY KGRCYDLEPV KGEENQYIAY 
101 VAYPLDLFEE GSVTNLFTSI VGNVFGFKAL RALRLEDLRI SVAYAKTFQG 
151 PPHGIEVERD KLNKYGRPLL GCTIKPKLGL SAKNYGRAVY ECLRGGLDFT 
201 KDDENVNSQP FMRWRDRFLF VAEAIYKSQA ETGEIKGHYL NATAATCEAM 

251 LQRAQCAKEL GVPIIMHDYL TGGWTANTSL AHYCRDHGLL LHIHRAMHAV 
301 IDRQKNHGMH FRVLAKSLRM SGGDHLHSGT WGKLEGERE VTLGFVDLMR 

351 DDFIEKDRAR GIYFTQDWVL LPGVMPVASG GIHVWHMPAL VEIFGDDACL 

401 QFGGGTLGHP WGNAPGAAAN RIACEACVQA RNEGRHLARE GGDVIRAACK 

451 WSPELAAACE VWKEIKFEFE TIDTL 

AT: 239-243 

Unspecified 
MAN 
CA 
CA 

1 REFERENCES IN FILE CA (1967 TO DATE) 

ANSWER 5 OF 12 COPYRIGHT 1993 ACS 
143108-32-1 REGISTRY 

Carboxylase, ribulose diphosphate (sorghum chloroplast clone pSGB510 
large subunit reduced) (9CI) (CA INDEX NAME) 

PROTEIN SEQUENCE 

Ribulose diphosphate carboxylase 

sorghum 

PSGB510 

reduced 

475 

1 MSPQTETKAS VGFKAGVKDY KLTYYTPEYE TKDTDILAAF RVTPQLGVPP 
51 EEAGAAVAAE SSTGTWTTVW TDGLTSLDRV KGRCYHIEPV PGDPDQYICY 
101 VAYPLDLFEE GSVTNMFVGI VGNVFGFKAL RALRLEDLRI PPAYLKTFQG 
151 PPRGIQIERD KLNKYGRPLL GCTIKPKLGL SAKNYGRACY ECLRGGLDFY 
201 KDDENVNSQP FMRWRDRFVF CAEAIYKAQA ETGEIKGHYL NATAGTCEEM 

251 IKRAVFAKEL GVPIVMHDYL TGGFTANTTL SHYCRDNGLL LHIHRAMHAV 
301 IDRQKNHGMM FRVLAKALRM SGGDHIHSGT WGKLEGERE ITLGFVDLLR 
351 DFIEKDRSRG IFFTQDWVSM PGVIPVASGG IHVWHMPALT EIIGDDSVLQ 
401 FGGGTLGHPW GNAPGAAANR VALEACVQAR NEGRDLAREG NEIIKAACKW 
451 SAELAAACEI WKEIKFDTFK AMDTL 
AT: 239-243 

Unspecified 
PMS, MAN 

Manual registration 

CA 

CA 

1 REFERENCES IN FILE CA (1967 TO DATE) 
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L40 ANSWER 6 OF 12 COPYRIGHT 1993 ACS 
RN 142845-03-2 REGISTRY 

CN . Cadherin M (mouse C-terminal fragment protein moiety reduced) (9CI) 
(CA INDEX NAME) 

FS PROTEIN SEQUENCE 
SSI Cadherin M 
ORGN mouse 
SHG reduced 
SQL 730 

SEQ 1 ENHKRLPYPL VQIKSDKQQL GSVIYSIQGP GVDEEPRNVF SIDKFTGRVY 

51 LNATLDREKT DRFRLRAFAL DLGGSTLEDP TDLEIVWDQ NDNRPAFLQD 

101 VFRGHILEGA IPGTFVTRAE ATDADDPETD NAALRFSILE QGSPEFFSID 
151 EHTGEIRTVQ VGLDREWAV YNLTLQVADM SGDGLTATAS AIISIDDIND 
201 NAPEFTKDEF FMEAAEAVSG VDVGRLEVED KDLPGSPNWV ARFTILEGDP 
251 DGQFKIYTDP KTNEGVLSW KPLDYESREQ YELRVSVQNE APLQAAAPRA 
301 RRGQTRVSVW VQDTNEAPVF PENPLRTSIA EGAPPGTSVA TFSARDPDTE 
351 QLQRISYSKD YDPEDWLQVD GATGRIQTQR VLSPASPFLK DGWYRAIILA 
401 LDNAIPPSTA TGTLSIEILE VNDHAPALAL PPSGSLCSEP DQGPGLLLGA 
451 TDEDLPPHGA PFHFQLNPRV PDLGRNWSVS QINVSHARLR LRHQVSEGLH 
501 RLSLLLQDSG EPPQQREQTL NVTVCRCGSD GTCLPGAAAL RGGGVGVSLG 
551 ALVIVLASTV VLLVLILFAA LRTRFRGHSR GKSLLHGLQE DLRDNILNYD 
601 EQGGGEEDQD AYDINQLRHP VEPRATSRSL GRPPLRRDAP FSYVPQPHRV 
651 LPTSPSDIAN FISDGLEAAD SDPSVPPYDT ALIYDYEGDG SVAGTLSSIL 
701 SSLGDEDQDY DYLRDWGPRF ARLADMYGHQ 
HITS AT: 50-54 

MF Unspecified 
Cl MAN 
SR CA 
LC CA 

1 REFERENCES IN FILE CA (1967 TO DATE) 

L40 ANSWER 7 OF 12 COPYRIGHT 1993 ACS 
RN 142661-94-7 REGISTRY 

CN Protein (Drosophila melanogaster clone .lambda.cO.22/.lambda.cO.32 
gene small-optic-lobes reduced) (9CI) (CA INDEX NAME) 

FS PROTEIN SEQUENCE 
SSI Protein 

ORGN Drosophila melanogaster 
CLO .lambda.cO.22/.lambda.cO.32 
GEN small-optic-lobes 
SHG reduced 
SQL 1597 

SEQ 1 MGTISSVLQW SCTKCNTINP TESLKCFNCG TVRKVFPQQQ QQQHRSSSIT 

51 ASWTADDALE QEQAEKGQER DKEKGRAAVA RSEYKHVYKS LLRGCLKRPQ 
101 RNSQNLPANC VDCEDTRKYI KSSIELYRHF SNPALNRRWV CHACGTDNSS 
151 VTWHCLICDT VSYLAPIYKD AIAADRGQDL AGSLGNRGEL LAADHSHPHH 
201 HHHYLHQELE EQHQHQLHSQ HLHKRHLKGR SASGSGSGPG SGSGLRRTQS 



Celsa 822043 


251 LSTAIDKSAS GRSCHICYAN NQSKDIFNLP QIKPAPQLTG IPPVAACSNS 
301 RFAIANDTFC RRKQNNNNKN QNHKWRESG AKRKYNFTIT TLSRSAAKDA 
351 GHGQMKPLRQ WNLNLNLQQ EPQQKSPANP QQLHRKTQRE PAAVSMNPTQ 
401 FTIPRNGVFI AVNEWSEPMA SSSSVSSSSN HHHHHHSNSN SNSSGNSNII 
451 NNNSSSSSGS NKLYENECVA LAQQQLRAAA AQAAQAAATA VAIASSPSAK 
501 AMAEPAPTAT MPIYAQVNKQ HKLKKKQQIA SESQTNNNTG SGEIADAVSE 
551 SLTAGLGTST DGSGEASESE SQVEEHSIYA KVWKGPRKAT ESKIMHDPGS 
601 SSRLSGAASA AAGTASAGAI AAGVGAAAAS RHDNKTQLGN GSRSKMWICI 
651 KCSYAYNRLW LQTCEMCEAK AEQQQQQLHL QQQQQQQQQH HHHHHHHLQQ 
701 QQAEAPRDEP WTCKKCTLVN YSTAMACWC GGSKLKSISS IEDMTLRKGE 
751 FWTCSHCTLK NSLHSPVCSA CKSHRQPQLS MAMEAVRERP DGQSYEEQDA 
801 AAVGGGGGSA HQSGANEVKA PTALNLPLTS VALPMPMLQL PTSTAAGLRG 
851 SRSPSPRMQL LPSLQQQRNS SSSGAIPKRH STGGSIVPRN ISIAGLANYN 
901 LQQGQGVGSA SWSASGAGS GAGAVGASTS SKKWQCPACT YDNCAASWC 
951 DICSSPRGLA SAVLGEALGR KSVRVALTPA DIRQESKLME NLRQLEETEA 
1001 LTKWQNIIQY CRDNSELFVD DSFPPAPKSL YYNPASGAGE GNPWQWRRP 
1051 HEINCDGGAY PPWAVFRTPL PSDICQGVLG NCWLLSALAV LAEREDLVKE 
1101 VLVTKEICGQ GAYQVRLCKD GKWTTVLVDD LLPCDKRGHL VYSQAKRKQL 
1151 WVPLIEKAVA KIHGCYEALV SGRAIEGLAT LTGAPCESIP LQASSLPMPS 
1201 EDELDKDLIW AQLLSSRCVR FLMGASCGGG NMKVDEEEYQ QKGLRPRHAY 
1251 SVLDVKDIQG HRLLKLRNPW GHYSWRGDWS DDSSLWTDDL RDALMPHGAS 
1301 EGVFWISFED VLNYFDCIDI CKVRSGWNEV RLQGTLQPLC SISCVLLTVL 


1351 EPTEAEFTLF QEGQRNSEKS QRSQLDLCW IFRTRSPAAP EIGRLVEHSK 
1401 RQVRGFVGCH KMLERDIYLL VCLAFNHWHT GIEDPHQYPQ CILAIHSSKR 
1451 LLVEQISPSP HLLADAIISL TLTKGQRHEG REGMTAYYLT KGWAGLWMV 
1501 ENRHENKWIH VKCDCQESYN WSTRGELKT VDSVPPLQRQ VIIVLTQLEG 
1551 SGGFSIAHRL THRLANSRGL HDWGPPGATH CPPIENVHGL HAPRLIT 
HITS AT: 1311-1315 

MF Unspecified 
Cl MAN 
SR CA 
LC CA 

1 REFERENCES IN FILE CA (1967 TO DATE) 

L40 ANSWER 8 OF 12 COPYRIGHT 1993 ACS 
RN 139076-28-1 REGISTRY 

CN Protein NS 3 (tick-borne encephalitis virus strain 205 clone 
I-47/III-30/I-30 reduced) (9CI) (CA INDEX NAME) 

FS PROTEIN SEQUENCE 
SSI Protein NS 3 

ORGN tick-borne encephalitis virus 
CLO I-47/III-30/1-30 
SHG reduced 
SQL 621 

SEQ 1 SDLVFSGQGG RERGDRPFEV KDGVYRIFSP GLLWGQRQVG VGYGSKGVLH 

51 TMWHVTRGAA LSIDDAVAGP YWADVKEDW CYGGAWSLEE KWKGETVQVH 
101 AFPPGRAHEV HQCQPGELLL DTGRRIGAVP IDLAKGTSGS PILNSQGVW 
151 GLYGNGLKTN ETYVSSIAQG EAEKSRPNLP PAVSGTGWTA KGQITVLDMH 
201 PGSGKTHRVL PELIRQCTDR RLRTLVLAPT RWLKEMERA LNGKRVRFHS 
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HITS 

MF 

Cl 

SR 

LC 


L40 

RN 

CN 


FS 

SSI 

ORGN 

CLO 

SHG 

SQL 

SEQ 


251 PAVGDQQVGG AIVDVMCHAT YVNRRLLPQG RQNWEVAIMD EAHWTDPHSI 
301 AARGHLYTLA KENKCALVLM TATPPGKSEP FPESNGAISS EEKQIPDGEW 
351 RDGFDWITEY EGRTAWFVPS IVKGGIIART LRQKGKSVIC LNSKTFEKDY 
401 SRVRDEKPDF WTTDISEMG ANLDVSRVID GRTNIKPEEV DGRVELTGTR 
451 RVTTASAAQR RGRVGRQEGR TDEYIYSGQC DDDDSGLVQW KEAQILLDNI 
501 TTLRGPVATF YGPEQDKMPE VAGHFRLTEE KRKHFRHLLT HCDFTPWLAW 

551 HVAANVSSVT SRNWTWEGPE ENTVDEANGD LVTFRSPNGA ERTLRPVWRD 


601 ARMFREGRDI REFVAYASGR R 
AT: 550-555 

Unspecified 
MAN 
CA 
CA 

1 REFERENCES IN FILE CA (1967 TO DATE) 

ANSWER 9 OF 12 COPYRIGHT 1993 ACS 
139076-24-7 REGISTRY 

Protein, poly- (tick-borne encephalitis virus strain 205 clone 
11-71/1-62/1-3/1-47/111-30/1-30/111-15/11—4/111-45/11-6/VI-49 
reduced) (9CI) (CA INDEX NAME) 

PROTEIN SEQUENCE 
Polyprotein 

tick-borne encephalitis virus 

II-71/I-62/I-3/I-47/III-30/1-30/111-15/11-4/III-45/II-6/VI-49 

reduced 

3414 

1 MAGKAILKGK GGGPPRRVSK ETAKKTRQSR VQMPNGLVLM RMMGILWHAV 
51 AGTARSPVLK SFWNSVPLKQ ATAALRKIKK AVSTLMVGLQ RRGKRRSAVD 
101 WTGWLLVWL IGVTLAATVR KERDGTTVIR AEGKDAATQV RVENGTCVIL 
151 ATDMGSWCDD SLTYECVTID QGEEPVDVDC FCRNVDGVYL EYGRCGKQEG 
201 SRTRRSVLIP SHAQGDLTGR GHKWLEGDSL RTHLTRVEGW VWKNKILTLA 
251 VIAWWLTVE SWTRIAVW VLLCLAPVYA SRCTHLENRD FVTGTQGTTR 
301 VTLVLELGGC VTITAEGKPS MDVWLDSIYQ ENPAKTREYC LHAKLSDTKV 
351 AARCPTMGPA TLAEEHQSGT VCKRDQSDRG WGNHCGLFGK GSIVTCVKAS 
401 CEAKKKATGH VYDANKIVYT VKVEPHTGDY VAANETHSGR KTASFTVSSE 
451 KTILTMGDYG DVSLLCRVAS GVDLAQTVIL ELDKTSEHLP TAWQVHRDWF 
501 NDLALPWRHE GAQNWNNAER LVEFGAPHAV KMDVYNLGDQ TGVLLKSLAG 
551 VPVAHIDGTK YHLKSGHVTC EVGLEKLKMK GLTYTMCDKT KFTWKRTPTD 
601 SGHDTWMEV AFSGTKPCRI PVRAVAHGSP DVNVAMLITP NPTIENNGGG 
651 FIEMQLPPGD NIIYVGELSH QWFQKGSSIG RVFQKTRKGI ERLTVIGEHA 
701 WDFGSTGGFL TSVGKALHTV LGGAFNSLFG GVGFLPKILM GVALAWLGLN 
751 MRNPTMSMGF LLAGGLVLAM TLGVGADVGC AVDTERMELR CGEGLWWRE 
801 VSEWYDNYAY YPETPGALAS AIKETFEEGT CGIVPQNRLE MAMWRSSATE 
851 LNLALAEGDA NLTVWDKLD PTDYRGGIPG LLRKGKDIKV SWKSWGHSMI 
901 WSVPEAPRRF MVGTEGSSEC PLERRKTGVF TVAEFGVGLR TKVFLDFRQE 
951 PTHECDTGVM GAAVKNGMAV HTDQSLWMKS VRNDTGTYIV ELLVTDLRNC 
1001 SWPASHTIDN AEWDSELFL PASLAGPRSW YNRIPGYSEQ VKGPWKYSPI 
1051 RVTREECPGT RVTINADCDK RGASVRSTTE SGKVIPEWCC RTCTLPPVTF 





HITS 

MF 

Cl 
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1101 RTGTDCWYAM EIRPVHDQGG LVRSMWADN GELLSEGGIP GIVALFWLE 
1151 YVIRRRPATG TTAMWGGIW LALLVTGLVK IESLVRYWA VGITFHLELG 
1201 PEIVALTLLQ AVFELRVGLL SAFALRSNLT VREMVTIYFL LLVLELGLPS 
1251 EGLGALWKWG DALAMGALIF RACTAEEKTG VGLLLMALMT QQDLAIAHYG 
1301 LMLFLGVASC YSIWKLIRGH REQKGLTWIV PLAGLLGGEG SGVRLLAFWE 
1351 LAIHGRRRSF SEPLTWGVM LTLASGMMRH TSQEALCALA VASFLLLMLV 
1401 LGTRKMQLVA EWSGCVEWHP ELMNEGGEVS LRVRQDSMGN FHLTELEKEE 
1451 RVMAFWLLAG LAASAFHWSG ILGVMGLWTL SEMLRTARRS DLVFSGQGGR 
1501 ERGDRPFEVK DGVYRIFSPG LLWGQRQVGV GYGSKGVLHT MWHVTRGAAL 
1551 SIDDAVAGPY WADVKEDWC YGGAWSLEEK WKGETVQVHA FPPGRAHEVH 
1601 QCQPGELLLD TGRRIGAVPI DLAKGTSGSP ILNSQGVWG LYGNGLKTNE 
1651 TYVSSIAQGE AEKSRPNLPP AVSGTGWTAK GQITVLDMHP GSGKTHRVLP 
1701 ELIRQCTDRR LRTLVLAPTR WLKEMERAL NGKRVRFHSP AVGDQQVGGA 
1751 IVDVMCHATY VNRRLLPQGR QNWEVAIMDE AHWTDPHSIA ARGHLYTLAK 
1801 ENKCALVLMT ATPPGKSEPF PESNGAISSE EKQIPDGEWR DGFDWITEYE 
1851 GRTAWFVPSI VKGGIIARTL RQKGKSVICL NSKTFEKDYS RVRDEKPDFV 
1901 VTTDISEMGA NLDVSRVIDG RTNIKPEEVD GRVELTGTRR VTTASAAQRR 
1951 GRVGRQEGRT DEYIYSGQCD DDDSGLVQWK EAQILLDNIT TLRGPVATFY 
2001 GPEQDKMPEV AGHFRLTEEK RKHFRHLLTH CDFTPWLAWH VAANVSSVTS 

2051 RNWTWEGPEE NTVDEANGDL VTFRSPNGAE RTLRPVWRDA RMFREGRDIR 
2101 EFVAYASGRR SFGDVLSGMS GVPELLRHRC VSAMDVFYTL MHEEPGSRAM 
2151 KMAERDAPEA FLTVAEMMVL GLATLGWWC FWRTSISRM MLGTLVLLAS 
2201 LALLWAGGVS YGNMAGVALI FYTLLTVLQP EAGKQRSSDD NKLAYFLLTL 
2251 CSLAGLVAAN EMGFLEKTKA DLSTVLWSEH EELRSWEEWT NIDIQPARSW 
2301 GTYVLWSLF TPYMIHQLQT KIQQLVNSAV VTGAQAMRDL GGGAPFFGIA 
2351 GHVMALGWS LVGATPTSLV VGVGLAAFHL AIWSGLEAE LTQRAHKVFF 
2401 SAMVRNPMVD GDVINPFGEG EAKPALYERK MSLVLAIVLC LMSWMNRTV 
2451 PSITEASAVG LAAAGQLLRP EVDTLWTMPV ACGLSGWRG SLWGFLPLGH 
2501 RLWLRASGSR RGGSEGDTLG DLWKRKLNGC TKEEFFAYRR TGILETERDK 
2551 ARELLRRGET NMGLAVSRGT AKLAWLEERG YATLKGEWD LGCGRGGWSY 
2601 YAASRPAVMS VKAYTIGGKG HETPRMVTSL GWNLIKFRAG MDVFSMQPHR 
2651 ADTIMCDIGE SNPDAWEGE RTRKVILLME QWKNRNPTAT CVFKVLAPYR 
2701 PEVIEALHRF QLQWGGGLVR TPFSRNSTHE MYYSTAVTGN IVNSVNIQSR 
2751 KLLARFGDQR GPTRVPELDL GVGTRCWLA EDKVKEKDVQ ERISALREQY 
2801 GETWHMDREH PYRTWQYWGS YRTAPTGSAA SLINGWKLL SWPWNAREDV 
2851 VRMAMTDTTA FGQQRVFKEK VDTKAQEPQP GTKVIMRAVN DWILERLARK 
2901 SKPRMCSREE FIAKVKSNAA LGAWSDEQNR WSSAKEAVED PAFWQLVDEE 
2951 RERHLAGRCA HCVYNMMGKR EKKLGEFGVA KGSRAIWYMW LGSRFLEFEA 
3001 LGFLNEDHWA SRGFSGSGVE GISLNYLGWY LKELSTLEGG LFYADDTAGW 
3051 DTKVTNADLE DEEQLLRYME GEHRQLAATI MQKAYHAKW KVARPSRDGG 
3101 CIMDVITRRD QRGSGQWTY ALNTLTNIKV QLIRMMEGEG VIEASDAHNP 
3151 RLLRVERWLR DHGEERLGRM LVSGDDCWR PVDDRFGRAL YFLNDMAKTR 
3201 KDIGEWEHSV GFSNWEEVPF CSHHFHELVM KDGRALIVPC RDQDELVGRA 
3251 RVSPGCGWSI RETACLSKAY GQMWLLSYFH RRDLRTLGLA ICSAVPVDWV 
3301 PTGRTTWSIH ASGAWMTTED MLDVWNRVWI LDNPFMHSKE KIVEWRDVPY 
3351 LPKSHDMLCS SLVGRKERAE WAKNIWGAVE KVRRMIGQEK FKDYLSCMDR 
3401 HDLHWELKLE SSII 
AT: 2039-2044 

Unspecified 
MAN 
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SR CA 
LC CA 

1 REFERENCES IN FILE CA (1967 TO DATE) 

L40 ANSWER 10 OF 12 COPYRIGHT 1993 ACS 
RN 137800-99-8 REGISTRY 

CN Protein IBMP (figwort mosaic virus clone DxS reduced) (9CI) (CA 
INDEX NAME) 

FS PROTEIN SEQUENCE 

SSI Protein IBMP 

ORGN figwort mosaic virus 

CLO DxS 

SHG reduced 

SQL 512 

SEQ 1 MEELKALRLK EKILEIELNS VKMQIHAYEE SLKATTVNSV QEGEILQTES 

51 IPECPAQGKE TPNPVKADSL LKTILGNERQ NPLEGKSSKL VNLTPKSDKD 
101 KVKSSPVANG SGKDSTKPLN PVALGKSKMT ILGQKQADEE EFKPDYLRAA 
151 SNGQSWFAVY KGPNKEFFTE WEIVADICKK RQKSKRFRSK EQAEVSISLY 


201 NKDIQDPVNF LRPVKLVKEE RAAQPLKFKA IAAEQTIQFD EFRQIWEKSR 
251 LSDLEDGVQE KFYTNDSASK STYTFVENAE PYLVHTAFRA GLAKVIYPSP 
301 NLQELKWFPE GIVKAIKNFR KKVLNAKDAA IFIKIFSSIP DWVQSTRYEP 
351 YHFIQIGIAK TKKELPSSKV CKEEFSVQSL NKVRVQSLQT ISQKLQEINE 
401 ESSIKVNYCS STCIMVSKFQ KKTSTEDLKL VGIFESNLVN IEQLACGDQT 
451 KKEWCRIVRR TYQKHLCLYC KDKADSSSTS GEQNNVEKSC PDSPLTNAYD 
501 ERSDDHKRIP SI 
HITS AT: 192-196 

MF Unspecified 
Cl MAN 
SR CA 
LC. CA 

1 REFERENCES IN FILE CA (1967 TO DATE) 

L40 ANSWER 11 OF 12 COPYRIGHT 1993 ACS 
RN 136249-46-2 REGISTRY 

CN Synthetase, ubiquitin-protein (Saccharomyces cerevisiae clone pUBRl 
reduced) (9CI) (CA INDEX NAME) 

FS PROTEIN SEQUENCE 

SSI Ubiquitin-protein synthetase 

ORGN Saccharomyces cerevisiae 

CLO pUBRl 

SHG reduced 

SQL 1950 

SEQ 1 MSVADDDLGS LQGHIRRTLR SIHNLPYFRY TRGPTERADM SRALKEFIYR 

51 YLYFVISNSG ENLPTLFNAH PKQKLSNPEL TVFPDSLEDA VDIDKITSQQ 
101 TIPFYKIDES RIGDVHKHTG RNCGRKFKIG EPLYRCHECG CDDTCVLCIH 
151 CFNPKDHVNH HVCTDICTEF TSGICDCGDE EAWNSPLHCK AEEQENDISE 
201 DPATNADIKE EDVWNDSVNI ALVELVLAEV FDYFIDVFNQ NIEPLPTIQK 
251 DITIKLREMT QQGKMYERAQ FLNDLKYEND YMFDGTTTAK TSPSNSPEAS 
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301 PSLAKIDPEN YTVIIYNDEY HNYSQATTAL RQGVPDNVHI DLLTSRIDGE 
351 GRAMLKCSQD LSSVLGGFFA VQTNGLSATL TSWSEYLHQE TCKYIILWIT 
401 HCLNIPNSSF QTTFRNMMGK TLCSEYLNAT ECRDMTPWE KYFSNKFDKN 


451 DPYRYIDLSI LADGNQIPLG HHKILPESST HSLSPLINDV ETPTSRTYSN 
501 TRLQHILYFD NRYWKRLRKD IQNVIIPTLA SSNLYKPIFC QQWEIFNHI 
551 TRSVAYMDRE PQLTAIRECV VQLFTCPTNA KNIFENQSFL DIVWSIIDIF 
601 KEFCKVEGGV LIWQRVQKSN LTKSYSISFK QGLYTVETLL SKVHDPNIPL 
651 RPKEIISLLT LCKLFNGAWK IKRKEGEHVL HEDQNFISYL EYTTSIYSII 
701 QTAEKVSEKS KDSIDSKLFL NAIRIISSFL GNRSLTYKLI YDSHEVIKFS 
751 VSHERVAFMN PLQTMLSFLI EKVSLKDAYE ALEDCSDFLK ISDFSLRSW 
801 LCSQIDVGFW VRNGMSVLHQ ASYYKNNPEL GSYSRDIHLN QLAILWERDD 
851 IPRIIYNILD RWELLDWFTG EVDYQHTVYE DKISFIIQQF IAFIYQILTE 
901 RQYFKTFSSL KDRRMDQIKN SIIYNLYMKP LSYSKLLRSV PDYLTEDTTE 
951 FDEALEEVSV FVEPKGLADN GVFKLKASLY AKVDPLKLLN LENEFESSAT 
1001 IIKSHLAKDK DEIAKWLIP QVSIKQLDKD ALNLGAFTRN TVFAKWYKL 
1051 LQVCLDMEDS TFLNELLHLV HGIFRDDELI NGKDSIPEAY LSKPICNLLL 
1101 SIANAKSDVF SESIVRKADY LLEKMIMKKP NELFESLIAS FGNQYVNDYK 
1151 DKKLRQGVNL QETEKERKRR LAKKHQARLL AKFNNQQTKF MKEHESEFDE 
1201 QDNDVDMVGE KVYESEDFTC ALCQDSSSTD FFVIPAYHDH SPIFRPGNIF 
1251 NPNEFMPMWD GFYNDDEKQA YIDDDVLEAL KENGSCGSRK VFVSCNHHIH 
1301 HNCFKRYVQK KRFSSNAFIC PLCQTFSNCT LPLCQTSKAN TGLSLDMFLE 
1351 SELSLDTLSR LFKPFTEENY RTINSIFSLM ISQCQGFDKA VRKRANFSHK 
1401 DVSLILSVHW ANTISMLEIA SRLEKPYSIS FFRSREQKYK TLKNILVCIM 
1451 LFTFVIGKPS MEFEPYPQQP DTVWNQNQLF QYIVRSALFS PVSLRQTVTE 
1501 ALTTFSRQFL RDFLQGLSDA EQVTKLYAKA SKIGDVLKVS EQMLFALRTI 
1551 SDVRMEGLDS ESIIYDLAYT FLLKSLLPTI RRCLVFIKVL HELVKDSENE 
1601 TLVINGHEVE EELEFEDTAE FVNKALKMIT EKESLVDLLT TQESIVSHPY 
1651 LENIPYEYCG IIKLIDLSKY LNTYVTQSKE IKLREERSQH MKNADNRLDF 
1701 KICLTCGVKV HLRADRHEMT KHLNKNCFKP FGAFLMPNSS EVCLHLTQPP 
1751 SNIFISAPYL NSHGEVGRNA MRRGDLTTLN LKRYEHLNRL WINNEIPGYI 
1801 SRVMGDEFRV TILSNGFLFA FNREPRPRRI PPTDEDDEDM EEGEDGFFTE 
1851 GNDEMDVDDE TGQAANLFGV GAEGIAGGGV RDFFQFFENF RNTLQPQGNG 
1901 DDDAPQNPPP ILQFLGPQFD GATIIRNTNP RNLDEDDSDD NDDSDEREIW 
HITS AT: 426-430 

MF Unspecified 
Cl PMS, MAN 
PCT Manual registration 
SR CA 
LC CA 

1 REFERENCES IN FILE CA (1967 TO DATE) 

L40 ANSWER 12 OF 12 COPYRIGHT 1993 ACS 
RN 130844-14-3 REGISTRY 

CN Protein (potato virus X strain Xc 165-kilodalton reduced) (9CI) (CA 
INDEX NAME) 

FS PROTEIN SEQUENCE 
SSI Protein 
ORGN potato virus X 
SHG reduced 
SQL 1456 
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SEQ 1 MAKVREVYQS FTDSTTKTLI QDEAYRNIRP IMEKHKLSNP YAQTIEAAND 

51 LEGFGIATNP YSIELHTHAA AKTIENKLLE VLGSLLPQEP VTFMFLKPRK 
101 LNFMRRNPRI KDIFHNVAIE PRDVARYPKE TIIHKLAEIE TDTAYISDTL 
151 HFLDPSYIVE TFQNCPKLQT LYATLVLPSE AAFKMESTHP NIYSLKYFGD 
201 GFQYIPGNHG GGAYHHEFTD LQWLKVGKIK WRDPKDGLLG HLNYTHEQVD 
251 THTVTVQLQE SFAANHLYCI RRGNMMTPEV RTFGQPDRYV LPPQIILPKV 
301 HNCKKPILKK TMMQLFLYVR TVKVAKNCDI FAKVRQLIKS SDRDKFSAVE 
351 LVYLVSYMEF LAALQATTCF SDTLSGGLLT KTLAPVRAWI QEKKMQLCGL 
401 EDYVKLVKAV DWRPVDFSFK VETWDFRFTQ LGMWKAFQPS ELSDVEEMNN 
451 FFDDGDLLDC FTRMPAYAVN AEEDLAGMRS NNQEETSTAP REPEGEKKEY 
501 INPAETFLDK LTRKHNRETR SRAAKKAKRL AEIQDSMNRD RTEEGSHKTP 
551 NMGEAPSNAD LPGVNEVEAG TTFPTLKALP QKWEDASSTD SSTIDPTEII 
601 PGEEDDKAAT QKVWGLPWK HWLPQLNAVG FKALEIQRDR NGTMIMPITE 
651 MVFGLDKEEF PEGTPEALAR ELKAMNRSPT TIPLDLLRAR DYGSDVKNKR 
701 IGAITKTQAA SWGEYLTGKI ESLPERKVAA CVIHGAGGSG KSHAIQKALR 
751 EIGKGSDITV VLPTNELRLD WSKKVPNTEP YMFKTYEKAL IGGTGSIVIF 
801 DDYSKLPPGY IEALVSFSTK IKLIILTGDS RQSVYHETSD DASIRHLGPA 
851 TEVFAKYCRY YLNATHRNKK DLANMLGVYS ERTGTTEISM SSEFLEGVPT 


901 LVPSDEKRRL YMGTGRNDTF TYAGCQGLTK PKVQIVLDHN TQVCSANVMY 
951 SALSRATDRI NFINTSANSS AFWEKLDSTP YLKTFLSWR EHALKEYEPA 
1001 EAEPIKEPEP QTHMCVENEE SVLEEYKEEL LEKFDREIHS DAHGHSNCVQ 
1051 TEDTTIQLFS HQEAKDETLL WATIDARLKT SNQESNFREF LSKRDIGDVL 
1101 FLNYQKAMGL PKEPIPFSQE VWEACAHEVQ SKYLSKSKCN LINGTVRQSP 
1151 DFDENKIMVF LKSQWVTKVE KLGLPKIKPG QTIAAFYQQT VMLFGTMARY 
1201 MRWFRQAFQP KEVFINCETT PEDMSAWALS NWNFTRPSLA NDYTAFDQSQ 
1251 DGAMLQFEVL KAKHHCIPEE IIQAYIDIKT HAQIFLGYLS IMRLTGEGPT 
1301 FDANTECNIA FTHYKFDIPA GTAQVYAGDD SALDCVPEVK QSFHRLEDKL 
1351 LLKSKPVITQ QKKGSWPEFC GWLITPKGVM KDPIKLHVSL KLAEAKGELR 
1401 KCQDSYEIDL SYAYDHKDSL NDLFDEKQCQ AHTLTCRTLI KSGRGTVSLP 
1451 RLKNFL 
HITS AT: 861-865 

MF Unspecified 
Cl MAN 
SR CA 
LC CA 

2 REFERENCES IN FILE CA (1967 TO DATE) 




